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1 . INTRODUCTION 

An eight bit microprogram CPU was looked into and was partially- 
designed and analyzed with different existing components. 
Thfi purpose of the above project was to aid us in arriving at 
a valid estimate, as far as speed, cost, packaging, and power 
dissipation for a microprogram 11/40 CPU with using different 
types of semiconductor components. The flow diagram used for 
«uch an implementation is shown in Figure 1. A breakdown of 
each of the sections drawn in the diagram as far as cost, 
technology, speed, pov/er dissipation, and packaging is analyzed 
in this report. Finally, a similar estimate on a microprogram 
11/40 CPU is made using today's components with some conclusions 
dravm. 
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Figure 1. 



2.0 BRIEF DESCRIPTION OP A MICROPROGRAMMED INSTRUCTION LOOP 

The microprograift loop consists basically of Read Only Mem- 
ory (ROM), an arithmetic unit plus shifters, and a scratch 
pad (R/Vl) memory (RAM) which can be a mixture of registers 
and fast semiconductor memories. This RAJ Memory is usually 
accessed by the POM or by the outside. 

The ROM's main function is to be the storage area for the 
microprogram instructions. Each WM. word can contain the 
source and destination addresses front and to the RAM, the 
next ROM address, the type of operation required, the shift- 
ing controls, and some test bits. The wider the word length 
of the ROM, the less control or address decodings are needed, 
and thus, the faster the cycle time. This loop consists 
most of the hardware required for a microprogrammed CPU, 
except for control logic. A detailed analysis of the ROM, 
RAM, and the arithmetic unit is discussed in the next sections 
of this report. 



3.0 READ ONLY MEMORY 

The bipolar ROMs seems to be the most attractive memories to 
use for our application today as far as speed, cost, and avail- 
ability. Different types of organizations are available from 
different vendors without any problems with second sourcing. 
The most popular organizations are the 32 words x 8 bits and the 
256 words x 4 bits, both of which have fully decoded addresses 
and open collector outputs for word expansion capability. 

Maximtjm read access time for the above devices is 60 ns and 
cost per bit is 5 cents using the large organization, and 10 
cents for the small one. Delivery time for a new mask is about 
three months. 

Both of the above units come in a 16 pins dual in line packages 
with 1 mw and 2 tm maximum power dissipation per bit for the 
large and small organization respectively. 

To avoid extra costs and time delays, a prototype machine using 
Read Only Memory can use RAM instead of ROM. The most popular 
bipolar RAM available in the market today, which is suited for 
such an application, is the 16 words x 4 bits organization. 
Properties of this type memory are discussed in the next section 
of this report. 



/i.O FAST R/W tlEMORY 

A read/write scratch pad meraory can be iifflplemented in many 
different ways with today's components. Bipolar semiconductor 
components are the only ones considered here because of its 
availability and cost, the table that follows gives a list of 
various bipolar components with their respective speed, cost, 
power dissipation, packaging, and expandability. The component 
underlined in the table is the one, I think, is best suited 
for our application. 



Table 1. 



Component 

No. 


Description 


IC Packs Req, 
for 16 bits 


Max Pwr 
Dis./Bit 


Speed 
Read Write 


Today ' s 
Cost/Bit 


Expandab i 1 i ty 


74H74 


2 D type flip-flop 
per package 


12 


120 row 


30 ns 35 ns 


$0.50 


16 vxn bits 


MC4015 


4 D type flip-flop 
per package 


8 


20 mw 


30 ns 40 ns 


0.44 


16 wxn bits 


DM8551 


4 D type flip-flops 
open collect, 
outputs 


4 


18 mw 


40 ns 55 ns 


1.00 


40 wxn bits 


74170 


4 w X 4 bits per IG, 


1 


10 mw 


55 ns 45 ns 


0.25 


1024 wxn bits 


Intel 














3101 


16 w X 4 bits per IC. 


1/4 


20 mw 


60 ns 40 ns 


0.37 


1024 wxn bits 


*Read time includes the address decoding time and read access time for the cell, while 
includes data set up time and write time. All of the components used, except for the 
afford some overlap between the read and write time. 


write time 
last one, can 
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The most attractive arithmetic unit available in the market 
today as far as performance, and cost, is the 74181 unit 
made hy Texas Instrument. A total of 32 different arith- 
raetic and logic operations can be achieved by this imit on 
two 4 bits words. These operations include add, subtract, 
complement f clear, set, increment, decrement, inclusive or, 
exclusive or, etc.-.^ It also provides a check if the two 
input vrards are equal. These units can be cascaded and used 
to perform operations on word lengths of up to 128 bits. 
However, word length effects the speed of the arithmetic 
operations, and a carry look ahead units (74182) can be used 
to speed up the operations. 

The table that follows gives the cost, maximum propagations 
time, and number of IC packages required for different word 
lengths, Maximiim power dissipation per bit using the above 
arithmetic jmits is 40 raw. 









Table 2 










Speed 


Number of IC Packages 


"~ 


Word Length 


Logic 


Operation 


Arithmetic Operation 


74181 


74182 


Total Cost* 


8 bits 




50 ns 


SO ns 


2 





14.00 


16 bits 




50 ns 


60 ns 


4 


1 


29,67 


32 bits 




50 ns 


90 ns 


8 


3 


€1,,01 


64 bits 




50 ns 


90 ns 


16 


5 


120.35 



* The 74181 comes in a 24 pins dual in line package at a current cost of $7.00. 
The 74182 is a 16 pins dual in line package and costs $1,67, 



SHJFTERF- 

For the nuiltiple levels cf shifting and rotating implementations/ 
discrete IC components are the recormiiendable components to use 
today as far as speed and cost„ IC coHiponents such as 74H53, 
74H60, and 74II62 represents one level of gating plus flexibility 

ill expansion. 

An average of two IC packages are reqiiired per bit for 16 differ- 
ent shifting and rotating operations with buffering at the out- 
put. Maximum propagation delay for a shifting operation plus 

buffering is 30 ns. 



Average cost for shifting per bit is aboat one dollar, and the 
coverage power dissipation is 150 imv* 



MICROPORGRAMI-IED IKETRUCTION CYCLE TIME 

The speed of a jnicroprograra instruction is heavily dependent 
on the technoloey used for implemsntatioB » With using the 
bipolar components iuentloned earlier, the inaxixniint cycle tixi'.e 
achieved is 180 ns i^hich is broken down into the following: 

1„ ROM Icitching time 20 ns max 

2=. Read access time for the R/W irieKiojry plus 

address decoding tirae 55 ns max 

3. Arithmetic operation time (16 tits words) 60 ns max 

4o Shift tine 30 ns max 

5o Write set xip time 15 ns max 

5, Skew tiiTie 10 ns max 



The read access tirae for the SOM is not accounted for in 
corap'ating the cycle tirae since the next RCT" address is usually 
contained in ROM iRstruction bits. Naturally, this access 
tirae should be added to the cycle tine at the start of the 
microprogram loop and lapon branching. In that case, the cycle 
tirae is delayed by 80 ns, AlsO/ note that by using the 74170 
components for the R/W memory, a gocd. deal of read and write 
overlap is achieved. 

Improvements on the cycle time can be raade in the future by 
utilising schottky gates in some parts of the loop such as. the 
shifters? the HOH latch netx-.-forki- and the R/T-J address decoders. 
By doing so? we can easily reduce the cycle time by 30 ns. 
Also, that the oniv effective way to reduce speed is to imple- 
ment all the loop? except the ROM^ by ECL logic where a 100 ns 
cycle tiiP.e can be achieved, Eowe-zer, this will tend to increase 
the nuiTiber of components and the cost by at least 30 percent. 
In addition to cost, there will be other problems we might have 
to face with s'ach as; system, noise ^ packaging, and single source 
items. Therefore y I recommend staying out for the time being 
frora using fast SCX logic, especially with our present system 
packaging schemes . 



8,0 COST ESTIMATE OF A MICR0PR0GRAI#1ED 11/40 CPU 

A breakdown of parts used in an 11/40 microprogrammed CPU is 
listed in the table that follows. Costs do not include wire 
wraps, juounting panels ^ systeEi checkout ^ or cables. 



Table 3, 





Sise 


No. of IC 
Packages required 


Pac]<-age Type 


Max Pvrr 
Dissipation 


,Cps 


±.__.™™ 


Logic 


1970 


JZmj^'- 


ROM 


512w X 48 bits 


24 


16 pins 


12W 


960 


450 


RM". 


32^? X 32 bits 


64 


16 pins 


low 


200 


120 


Ar it lime tic- 
Unit 


56 bits 


18 


14IC of 24 pins 
4IC of 16 pins 


2 « 5W 


105 


50 


General 
Purpose addr. 


16 bits 


4 


16 pins 


.5W 


12 


6 


General 
Registers 


10 X 16 bits 


80 


14 pins 


15W 


60 




Shifters 


56 bits 


100 


14 pins 


5W 


50 


30 


Control 
Logic 


— 


300 


mixed 


40W 


300 


11 h 


Line Drives 
and Receivers 


120 lines 


50 


mixed 


5W 


50 


30 


Terminating 

Resistors 


120 lines 


«-. 


„_. 


6W 


5 


./■ 


Etched 
Boards 


15 Quad» size 


— 


. — 


— 


450 


330 


Power Supply 


— 


.-- 


— 


— 


150 


120 


TOTAL 


,.„, _ 


640 


mixed 


96W 


2342 


1370 



9.0 CONCLUSIONS 

The following factors should be considered in deciding whether 
we should build a microprograinined or a conventional type 11/40 
CPU. 

Economy 

To luy knowledge, a microprogrammed 11/40 CPU can use at least 
10 percent less control logic than that of a conventionally 
built 11/40 CPU. Hov;ever, in adding the raicroprogram storage, 
the overall iriicroprograrnmed machine can now cost about 25 percent 
Hiore than the conventional one. This cost figure will change 
substantially vjithin the next two years ^ as it is estimated 
that the cost of the microprogram storage will drop by half by 
1972 to give the convential CPU a cost advantage of only 10 
percent over the microprogrammed one. 

■Speed 

Using the same type technology in both machines, a conventional 
11/40 CPU can go as fast as 120 ns/per machine cycle time com- 
pared to 180 ns for a microprogrammed instruction without in- 
cluding branching time. Thus, if a fast machine is required, 
an exclusive microprogrammed machine is not the way to go. 

Ease of Design and Flexibility 

A mi croprogranraied machine is known to be more flexible and easier 
to design. Also, as the computer instruction set grows the 
microprogrammed machine's logic external to the microprogram 
storage area needs not to increase while the control logic of 
the conventional machine increases. 

Packaging 

Again, a microprogrammed machine wins here since a large por- 
tion of it is amenable to use MSI chips. I estimate that a 
convential machine uses at least 10 percent more IC packages 
than that of a microprogrammed machine. However, this figure 
might drop if custom designed LSI chips are used two years 
from now. 



